Dataset statistics
| Number of variables | 13 |
|---|---|
| Number of observations | 661509 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 349.5 MiB |
| Average record size in memory | 554.0 B |
Variable types
| NUM | 7 |
|---|---|
| CAT | 6 |
Reproduction
| Analysis started | 2020-05-16 13:46:17.372178 |
|---|---|
| Analysis finished | 2020-05-16 17:00:47.450195 |
| Version | pandas-profiling v2.6.0 |
| Command line | pandas_profiling --config_file config.yaml [YOUR_FILE.csv] |
| Download configuration | config.yaml |
purchase_created_at has a high cardinality: 3494 distinct values | High cardinality |
purchase_updated_at has a high cardinality: 3494 distinct values | High cardinality |
delivery_time has a high cardinality: 1833 distinct values | High cardinality |
card_creation_date has a high cardinality: 3049 distinct values | High cardinality |
card_update_date has a high cardinality: 3049 distinct values | High cardinality |
product_ordered_quantity is highly skewed (γ1 = 21.28503473) | Skewed |
product_total_ordered_price is highly skewed (γ1 = 58.71478973) | Skewed |
purchase_created_at only contains datetime values, but is categorical. Consider applying pd.to_datetime() | Type |
purchase_updated_at only contains datetime values, but is categorical. Consider applying pd.to_datetime() | Type |
delivery_time only contains datetime values, but is categorical. Consider applying pd.to_datetime() | Type |
card_creation_date only contains datetime values, but is categorical. Consider applying pd.to_datetime() | Type |
card_update_date only contains datetime values, but is categorical. Consider applying pd.to_datetime() | Type |
| Distinct count | 661509 |
|---|---|
| Unique (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 330754 |
|---|---|
| Minimum | 0 |
| Maximum | 661508 |
| Zeros | 1 |
| Zeros (%) | < 0.1% |
| Memory size | 5.0 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 33075.4 |
| Q1 | 165377 |
| median | 330754 |
| Q3 | 496131 |
| 95-th percentile | 628432.6 |
| Maximum | 661508 |
| Range | 661508 |
| Interquartile range (IQR) | 330754 |
Descriptive statistics
| Standard deviation | 190961.3439 |
|---|---|
| Coefficient of variation (CV) | 0.5773515784 |
| Kurtosis | -1.2 |
| Mean | 330754 |
| Median Absolute Deviation (MAD) | 165377.25 |
| Skewness | 1.885061978e-15 |
| Sum | 2.187967478e+11 |
| Variance | 3.646623488e+10 |
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[ 0. 661508.], "bayesian blocks" binning strategy used)
| Value | Count | Frequency (%) | |
| 2047 | 1 | < 0.1% | |
| 181490 | 1 | < 0.1% | |
| 242908 | 1 | < 0.1% | |
| 232667 | 1 | < 0.1% | |
| 230618 | 1 | < 0.1% | |
| 236761 | 1 | < 0.1% | |
| 234712 | 1 | < 0.1% | |
| 257239 | 1 | < 0.1% | |
| 255190 | 1 | < 0.1% | |
| 261333 | 1 | < 0.1% | |
| Other values (661499) | 661499 | > 99.9% |
| Value | Count | Frequency (%) | |
| 0 | 1 | < 0.1% | |
| 1 | 1 | < 0.1% | |
| 2 | 1 | < 0.1% | |
| 3 | 1 | < 0.1% | |
| 4 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 661508 | 1 | < 0.1% | |
| 661507 | 1 | < 0.1% | |
| 661506 | 1 | < 0.1% | |
| 661505 | 1 | < 0.1% | |
| 661504 | 1 | < 0.1% |
purchase_id
Real number (ℝ≥0)
| Distinct count | 3494 |
|---|---|
| Unique (%) | 0.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 925745.3898 |
|---|---|
| Minimum | 53326 |
| Maximum | 5305984 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 5.0 MiB |
Quantile statistics
| Minimum | 53326 |
|---|---|
| 5-th percentile | 188488 |
| Q1 | 537635 |
| median | 663624 |
| Q3 | 909924 |
| 95-th percentile | 3500907 |
| Maximum | 5305984 |
| Range | 5252658 |
| Interquartile range (IQR) | 372289 |
Descriptive statistics
| Standard deviation | 948448.2808 |
|---|---|
| Coefficient of variation (CV) | 1.024523904 |
| Kurtosis | 9.353351229 |
| Mean | 925745.3898 |
| Median Absolute Deviation (MAD) | 504100.9336 |
| Skewness | 3.146356625 |
| Sum | 6.123889071e+11 |
| Variance | 8.995541413e+11 |
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[ 53326. 53994. 55294. 55931. 56201. ... 5296877. 5296921. 5296931.5 5305464.5 5305984. ], "bayesian blocks" binning strategy used)
| Value | Count | Frequency (%) | |
| 3345789 | 2520 | 0.4% | |
| 855016 | 2160 | 0.3% | |
| 852667 | 2160 | 0.3% | |
| 854241 | 2160 | 0.3% | |
| 494006 | 2156 | 0.3% | |
| 502471 | 2100 | 0.3% | |
| 505917 | 2100 | 0.3% | |
| 1011426 | 1938 | 0.3% | |
| 574360 | 1771 | 0.3% | |
| 577625 | 1771 | 0.3% | |
| Other values (3484) | 640673 | 96.9% |
| Value | Count | Frequency (%) | |
| 53326 | 25 | < 0.1% | |
| 54662 | 55 | < 0.1% | |
| 54664 | 60 | < 0.1% | |
| 55924 | 170 | < 0.1% | |
| 55938 | 33 | < 0.1% |
| Value | Count | Frequency (%) | |
| 5305984 | 201 | < 0.1% | |
| 5304945 | 68 | < 0.1% | |
| 5296940 | 30 | < 0.1% | |
| 5296923 | 78 | < 0.1% | |
| 5296919 | 44 | < 0.1% |
| Distinct count | 3494 |
|---|---|
| Unique (%) | 0.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 5.0 MiB |
| 2020-04-11 06:43:44.610002+00:00 | 2520 |
|---|---|
| 2020-01-24 08:28:25.153592+00:00 | 2160 |
| 2020-01-23 16:53:27.118261+00:00 | 2160 |
| 2020-01-24 13:17:56.586211+00:00 | 2160 |
| 2019-09-03 16:27:58.484448+00:00 | 2156 |
| Other values (3489) |
| Value | Count | Frequency (%) | |
| 2020-04-11 06:43:44.610002+00:00 | 2520 | 0.4% | |
| 2020-01-24 08:28:25.153592+00:00 | 2160 | 0.3% | |
| 2020-01-23 16:53:27.118261+00:00 | 2160 | 0.3% | |
| 2020-01-24 13:17:56.586211+00:00 | 2160 | 0.3% | |
| 2019-09-03 16:27:58.484448+00:00 | 2156 | 0.3% | |
| 2019-09-07 09:14:10.933370+00:00 | 2100 | 0.3% | |
| 2019-09-09 08:37:21.599611+00:00 | 2100 | 0.3% | |
| 2020-03-10 03:53:33.391010+00:00 | 1938 | 0.3% | |
| 2019-10-13 21:10:30.848070+00:00 | 1771 | 0.3% | |
| 2019-10-09 20:15:28.172636+00:00 | 1771 | 0.3% | |
| Other values (3484) | 640673 | 96.9% |
Length
| Max length | 32 |
|---|---|
| Mean length | 32 |
| Min length | 32 |
| Value | Count | Frequency (%) | |
| Decimal_Number | 10 | 66.7% | |
| Other_Punctuation | 2 | 13.3% | |
| Math_Symbol | 1 | 6.7% | |
| Dash_Punctuation | 1 | 6.7% | |
| Space_Separator | 1 | 6.7% |
| Value | Count | Frequency (%) | |
| Common | 15 | 100.0% |
| Value | Count | Frequency (%) | |
| ASCII | 15 | 100.0% |
| Distinct count | 3494 |
|---|---|
| Unique (%) | 0.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 5.0 MiB |
| 2020-04-21 07:13:28.354099+00:00 | 2520 |
|---|---|
| 2020-01-24 13:19:12.671439+00:00 | 2160 |
| 2020-01-24 05:00:30.174044+00:00 | 2160 |
| 2020-01-24 13:14:05.998308+00:00 | 2160 |
| 2019-09-07 08:04:24.122127+00:00 | 2156 |
| Other values (3489) |
| Value | Count | Frequency (%) | |
| 2020-04-21 07:13:28.354099+00:00 | 2520 | 0.4% | |
| 2020-01-24 13:19:12.671439+00:00 | 2160 | 0.3% | |
| 2020-01-24 05:00:30.174044+00:00 | 2160 | 0.3% | |
| 2020-01-24 13:14:05.998308+00:00 | 2160 | 0.3% | |
| 2019-09-07 08:04:24.122127+00:00 | 2156 | 0.3% | |
| 2019-09-08 19:23:17.043479+00:00 | 2100 | 0.3% | |
| 2019-09-11 19:12:21.825288+00:00 | 2100 | 0.3% | |
| 2020-03-16 13:13:09.761365+00:00 | 1938 | 0.3% | |
| 2019-10-10 19:37:28.366433+00:00 | 1771 | 0.3% | |
| 2019-10-10 15:31:34.437013+00:00 | 1771 | 0.3% | |
| Other values (3484) | 640673 | 96.9% |
Length
| Max length | 32 |
|---|---|
| Mean length | 32 |
| Min length | 32 |
| Value | Count | Frequency (%) | |
| Decimal_Number | 10 | 66.7% | |
| Other_Punctuation | 2 | 13.3% | |
| Math_Symbol | 1 | 6.7% | |
| Dash_Punctuation | 1 | 6.7% | |
| Space_Separator | 1 | 6.7% |
| Value | Count | Frequency (%) | |
| Common | 15 | 100.0% |
| Value | Count | Frequency (%) | |
| ASCII | 15 | 100.0% |
| Distinct count | 1833 |
|---|---|
| Unique (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 5.0 MiB |
| 2019-10-03 05:00:00+00:00 | 5312 |
|---|---|
| 2019-11-11 06:00:00+00:00 | 4863 |
| 2019-11-08 06:00:00+00:00 | 4220 |
| 2019-12-30 06:00:00+00:00 | 3424 |
| 2019-10-05 05:00:00+00:00 | 3270 |
| Other values (1828) |
| Value | Count | Frequency (%) | |
| 2019-10-03 05:00:00+00:00 | 5312 | 0.8% | |
| 2019-11-11 06:00:00+00:00 | 4863 | 0.7% | |
| 2019-11-08 06:00:00+00:00 | 4220 | 0.6% | |
| 2019-12-30 06:00:00+00:00 | 3424 | 0.5% | |
| 2019-10-05 05:00:00+00:00 | 3270 | 0.5% | |
| 2019-10-10 11:00:00+00:00 | 3264 | 0.5% | |
| 2019-08-07 06:00:00+00:00 | 3230 | 0.5% | |
| 2020-02-27 18:00:00+00:00 | 3213 | 0.5% | |
| 2019-11-07 18:00:00+00:00 | 3165 | 0.5% | |
| 2019-11-21 06:00:00+00:00 | 3153 | 0.5% | |
| Other values (1823) | 624395 | 94.4% |
Length
| Max length | 25 |
|---|---|
| Mean length | 25 |
| Min length | 25 |
| Value | Count | Frequency (%) | |
| Decimal_Number | 10 | 71.4% | |
| Other_Punctuation | 1 | 7.1% | |
| Math_Symbol | 1 | 7.1% | |
| Dash_Punctuation | 1 | 7.1% | |
| Space_Separator | 1 | 7.1% |
| Value | Count | Frequency (%) | |
| Common | 14 | 100.0% |
| Value | Count | Frequency (%) | |
| ASCII | 14 | 100.0% |
client_type
Categorical
| Distinct count | 3 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 5.0 MiB |
| AND | |
|---|---|
| IOS | |
| WEB |
| Value | Count | Frequency (%) | |
| AND | 363476 | 54.9% | |
| IOS | 185029 | 28.0% | |
| WEB | 113004 | 17.1% |
Length
| Max length | 3 |
|---|---|
| Mean length | 3 |
| Min length | 3 |
| Value | Count | Frequency (%) | |
| Uppercase_Letter | 9 | 100.0% |
| Value | Count | Frequency (%) | |
| Latin | 9 | 100.0% |
| Value | Count | Frequency (%) | |
| ASCII | 9 | 100.0% |
order_total_ordered_price
Real number (ℝ≥0)
| Distinct count | 2585 |
|---|---|
| Unique (%) | 0.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 378.4980118 |
|---|---|
| Minimum | 50.07 |
| Maximum | 5280 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 5.0 MiB |
Quantile statistics
| Minimum | 50.07 |
|---|---|
| 5-th percentile | 117.25 |
| Q1 | 281.48 |
| median | 343.3 |
| Q3 | 448.51 |
| 95-th percentile | 688.49 |
| Maximum | 5280 |
| Range | 5229.93 |
| Interquartile range (IQR) | 167.03 |
Descriptive statistics
| Standard deviation | 228.3123864 |
|---|---|
| Coefficient of variation (CV) | 0.6032063029 |
| Kurtosis | 94.5142362 |
| Mean | 378.4980118 |
| Median Absolute Deviation (MAD) | 132.6760562 |
| Skewness | 6.457612671 |
| Sum | 250379841.3 |
| Variance | 52126.54577 |
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[ 50.07 50.08 50.21 50.24 50.395 ... 4370.55 4538.715 4732.765 4855.8 5280. ], "bayesian blocks" binning strategy used)
| Value | Count | Frequency (%) | |
| 1204.71 | 6480 | 1.0% | |
| 301.08 | 5229 | 0.8% | |
| 569.96 | 4830 | 0.7% | |
| 324.85 | 4470 | 0.7% | |
| 467.25 | 4200 | 0.6% | |
| 397.1 | 4153 | 0.6% | |
| 289.84 | 4134 | 0.6% | |
| 343.3 | 3542 | 0.5% | |
| 390.29 | 3458 | 0.5% | |
| 435.25 | 3399 | 0.5% | |
| Other values (2575) | 617614 | 93.4% |
| Value | Count | Frequency (%) | |
| 50.07 | 48 | < 0.1% | |
| 50.09 | 63 | < 0.1% | |
| 50.19 | 112 | < 0.1% | |
| 50.23 | 18 | < 0.1% | |
| 50.25 | 48 | < 0.1% |
| Value | Count | Frequency (%) | |
| 5280 | 20 | < 0.1% | |
| 4947.4 | 17 | < 0.1% | |
| 4764.2 | 66 | < 0.1% | |
| 4701.33 | 251 | < 0.1% | |
| 4376.1 | 40 | < 0.1% |
| Distinct count | 3049 |
|---|---|
| Unique (%) | 0.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 5.0 MiB |
| 2019-05-20 15:46:31.895738+00:00 | 3479 |
|---|---|
| 2018-12-21 13:11:19.400910+00:00 | 1958 |
| 2020-01-12 08:35:29.533406+00:00 | 1911 |
| 2019-11-21 21:18:03.402752+00:00 | 1911 |
| 2019-11-25 12:43:25.972178+00:00 | 1911 |
| Other values (3044) |
| Value | Count | Frequency (%) | |
| 2019-05-20 15:46:31.895738+00:00 | 3479 | 0.5% | |
| 2018-12-21 13:11:19.400910+00:00 | 1958 | 0.3% | |
| 2020-01-12 08:35:29.533406+00:00 | 1911 | 0.3% | |
| 2019-11-21 21:18:03.402752+00:00 | 1911 | 0.3% | |
| 2019-11-25 12:43:25.972178+00:00 | 1911 | 0.3% | |
| 2020-03-04 00:13:43.689145+00:00 | 1911 | 0.3% | |
| 2020-01-20 17:18:04.260143+00:00 | 1911 | 0.3% | |
| 2020-01-16 16:47:21.570054+00:00 | 1911 | 0.3% | |
| 2019-12-30 05:01:53.339234+00:00 | 1911 | 0.3% | |
| 2019-12-28 09:44:08.476051+00:00 | 1911 | 0.3% | |
| Other values (3039) | 640784 | 96.9% |
Length
| Max length | 32 |
|---|---|
| Mean length | 32 |
| Min length | 32 |
| Value | Count | Frequency (%) | |
| Decimal_Number | 10 | 66.7% | |
| Other_Punctuation | 2 | 13.3% | |
| Math_Symbol | 1 | 6.7% | |
| Dash_Punctuation | 1 | 6.7% | |
| Space_Separator | 1 | 6.7% |
| Value | Count | Frequency (%) | |
| Common | 15 | 100.0% |
| Value | Count | Frequency (%) | |
| ASCII | 15 | 100.0% |
| Distinct count | 3049 |
|---|---|
| Unique (%) | 0.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 5.0 MiB |
| 2019-11-06 11:17:21.523787+00:00 | 3479 |
|---|---|
| 2019-11-06 11:16:31.141085+00:00 | 1958 |
| 2019-12-28 09:44:08.478382+00:00 | 1911 |
| 2020-03-04 00:13:43.691848+00:00 | 1911 |
| 2019-11-21 21:18:03.405506+00:00 | 1911 |
| Other values (3044) |
| Value | Count | Frequency (%) | |
| 2019-11-06 11:17:21.523787+00:00 | 3479 | 0.5% | |
| 2019-11-06 11:16:31.141085+00:00 | 1958 | 0.3% | |
| 2019-12-28 09:44:08.478382+00:00 | 1911 | 0.3% | |
| 2020-03-04 00:13:43.691848+00:00 | 1911 | 0.3% | |
| 2019-11-21 21:18:03.405506+00:00 | 1911 | 0.3% | |
| 2020-01-12 08:35:29.536169+00:00 | 1911 | 0.3% | |
| 2020-01-20 17:18:04.262610+00:00 | 1911 | 0.3% | |
| 2019-12-11 16:13:46.463729+00:00 | 1911 | 0.3% | |
| 2020-01-16 16:47:21.572998+00:00 | 1911 | 0.3% | |
| 2019-11-27 02:13:20.756364+00:00 | 1911 | 0.3% | |
| Other values (3039) | 640784 | 96.9% |
Length
| Max length | 32 |
|---|---|
| Mean length | 32 |
| Min length | 32 |
| Value | Count | Frequency (%) | |
| Decimal_Number | 10 | 66.7% | |
| Other_Punctuation | 2 | 13.3% | |
| Math_Symbol | 1 | 6.7% | |
| Dash_Punctuation | 1 | 6.7% | |
| Space_Separator | 1 | 6.7% |
| Value | Count | Frequency (%) | |
| Common | 15 | 100.0% |
| Value | Count | Frequency (%) | |
| ASCII | 15 | 100.0% |
card_country_code
Real number (ℝ≥0)
| Distinct count | 71 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 478.4148606 |
|---|---|
| Minimum | 32 |
| Maximum | 858 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 5.0 MiB |
Quantile statistics
| Minimum | 32 |
|---|---|
| 5-th percentile | 76 |
| Q1 | 250 |
| median | 591 |
| Q3 | 724 |
| 95-th percentile | 792 |
| Maximum | 858 |
| Range | 826 |
| Interquartile range (IQR) | 474 |
Descriptive statistics
| Standard deviation | 264.0516105 |
|---|---|
| Coefficient of variation (CV) | 0.5519302018 |
| Kurtosis | -1.524375844 |
| Mean | 478.4148606 |
| Median Absolute Deviation (MAD) | 243.7864823 |
| Skewness | -0.3203465518 |
| Sum | 316475736 |
| Variance | 69723.25303 |
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[ 32. 34. 38. 48. 62. ... 811. 822. 833. 849. 858.], "bayesian blocks" binning strategy used)
| Value | Count | Frequency (%) | |
| 724 | 200754 | 30.3% | |
| 76 | 90759 | 13.7% | |
| 250 | 78303 | 11.8% | |
| 156 | 35541 | 5.4% | |
| 608 | 25876 | 3.9% | |
| 702 | 25609 | 3.9% | |
| 470 | 18851 | 2.8% | |
| 840 | 17293 | 2.6% | |
| 276 | 13974 | 2.1% | |
| 531 | 13437 | 2.0% | |
| Other values (61) | 141112 | 21.3% |
| Value | Count | Frequency (%) | |
| 32 | 1865 | 0.3% | |
| 36 | 163 | < 0.1% | |
| 40 | 170 | < 0.1% | |
| 56 | 1554 | 0.2% | |
| 68 | 5 | < 0.1% |
| Value | Count | Frequency (%) | |
| 858 | 271 | < 0.1% | |
| 840 | 17293 | 2.6% | |
| 826 | 6764 | 1.0% | |
| 818 | 99 | < 0.1% | |
| 804 | 210 | < 0.1% |
product_id
Real number (ℝ≥0)
| Distinct count | 6888 |
|---|---|
| Unique (%) | 1.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 36144.60831 |
|---|---|
| Minimum | 1009 |
| Maximum | 96767 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 5.0 MiB |
Quantile statistics
| Minimum | 1009 |
|---|---|
| 5-th percentile | 2856 |
| Q1 | 12107 |
| median | 34027 |
| Q3 | 56159 |
| 95-th percentile | 79443 |
| Maximum | 96767 |
| Range | 95758 |
| Interquartile range (IQR) | 44052 |
Descriptive statistics
| Standard deviation | 25336.09763 |
|---|---|
| Coefficient of variation (CV) | 0.7009647859 |
| Kurtosis | -1.134124142 |
| Mean | 36144.60831 |
| Median Absolute Deviation (MAD) | 21967.50633 |
| Skewness | 0.2983855469 |
| Sum | 2.39099837e+10 |
| Variance | 641917843 |
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[ 1009. 1051.5 1233.5 1257.5 1275. ... 95537. 96258.5 96650. 96656.5 96767. ], "bayesian blocks" binning strategy used)
| Value | Count | Frequency (%) | |
| 47818 | 3759 | 0.6% | |
| 4041 | 3686 | 0.6% | |
| 5064 | 3650 | 0.6% | |
| 2811 | 3440 | 0.5% | |
| 19898 | 2795 | 0.4% | |
| 31003 | 2643 | 0.4% | |
| 79426 | 2605 | 0.4% | |
| 17132 | 2524 | 0.4% | |
| 18055 | 2363 | 0.4% | |
| 59066 | 2326 | 0.4% | |
| Other values (6878) | 631718 | 95.5% |
| Value | Count | Frequency (%) | |
| 1009 | 874 | 0.1% | |
| 1094 | 257 | < 0.1% | |
| 1219 | 211 | < 0.1% | |
| 1248 | 1 | < 0.1% | |
| 1267 | 392 | 0.1% |
| Value | Count | Frequency (%) | |
| 96767 | 17 | < 0.1% | |
| 96658 | 31 | < 0.1% | |
| 96655 | 9 | < 0.1% | |
| 96651 | 6 | < 0.1% | |
| 96649 | 8 | < 0.1% |
| Distinct count | 78 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.911951372 |
|---|---|
| Minimum | 0.005 |
| Maximum | 200 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 5.0 MiB |
Quantile statistics
| Minimum | 0.005 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 1 |
| Q3 | 2 |
| 95-th percentile | 5 |
| Maximum | 200 |
| Range | 199.995 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 2.453269106 |
|---|---|
| Coefficient of variation (CV) | 1.28312317 |
| Kurtosis | 1042.383779 |
| Mean | 1.911951372 |
| Median Absolute Deviation (MAD) | 1.079277282 |
| Skewness | 21.28503473 |
| Sum | 1264773.04 |
| Variance | 6.018529305 |
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[5.00e-03 7.75e-02 1.60e-01 2.75e-01 3.50e-01 ... 4.05e+01 4.85e+01 5.20e+01 1.35e+02 2.00e+02], "bayesian blocks" binning strategy used)
| Value | Count | Frequency (%) | |
| 1 | 390291 | 59.0% | |
| 2 | 155840 | 23.6% | |
| 3 | 45502 | 6.9% | |
| 4 | 28530 | 4.3% | |
| 5 | 20561 | 3.1% | |
| 6 | 6704 | 1.0% | |
| 10 | 5587 | 0.8% | |
| 8 | 1950 | 0.3% | |
| 7 | 1344 | 0.2% | |
| 12 | 954 | 0.1% | |
| Other values (68) | 4246 | 0.6% |
| Value | Count | Frequency (%) | |
| 0.005 | 2 | < 0.1% | |
| 0.15 | 193 | < 0.1% | |
| 0.17 | 6 | < 0.1% | |
| 0.2 | 16 | < 0.1% | |
| 0.25 | 20 | < 0.1% |
| Value | Count | Frequency (%) | |
| 200 | 7 | < 0.1% | |
| 150 | 11 | < 0.1% | |
| 120 | 23 | < 0.1% | |
| 100 | 20 | < 0.1% | |
| 88 | 9 | < 0.1% |
| Distinct count | 2192 |
|---|---|
| Unique (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5.705012978 |
|---|---|
| Minimum | 0.02 |
| Maximum | 4365 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 5.0 MiB |
Quantile statistics
| Minimum | 0.02 |
|---|---|
| 5-th percentile | 0.8 |
| Q1 | 1.55 |
| median | 2.79 |
| Q3 | 5.1 |
| 95-th percentile | 15.96 |
| Maximum | 4365 |
| Range | 4364.98 |
| Interquartile range (IQR) | 3.55 |
Descriptive statistics
| Standard deviation | 22.95244615 |
|---|---|
| Coefficient of variation (CV) | 4.023206649 |
| Kurtosis | 5968.338271 |
| Mean | 5.705012978 |
| Median Absolute Deviation (MAD) | 5.121755813 |
| Skewness | 58.71478973 |
| Sum | 3773917.43 |
| Variance | 526.8147841 |
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[2.00000e-02 1.65000e-01 2.05000e-01 2.15000e-01 2.25000e-01 ... 5.16400e+02 6.68000e+02 1.29875e+03 2.08500e+03 4.36500e+03], "bayesian blocks" binning strategy used)
| Value | Count | Frequency (%) | |
| 1 | 11397 | 1.7% | |
| 2 | 9649 | 1.5% | |
| 1.5 | 7878 | 1.2% | |
| 0.95 | 7015 | 1.1% | |
| 1.3 | 6762 | 1.0% | |
| 1.9 | 6623 | 1.0% | |
| 1.8 | 6436 | 1.0% | |
| 0.99 | 6302 | 1.0% | |
| 1.2 | 6292 | 1.0% | |
| 3 | 6055 | 0.9% | |
| Other values (2182) | 587100 | 88.8% |
| Value | Count | Frequency (%) | |
| 0.02 | 2 | < 0.1% | |
| 0.1 | 7 | < 0.1% | |
| 0.15 | 2 | < 0.1% | |
| 0.18 | 10 | < 0.1% | |
| 0.19 | 7 | < 0.1% |
| Value | Count | Frequency (%) | |
| 4365 | 1 | < 0.1% | |
| 2730 | 4 | < 0.1% | |
| 2599 | 3 | < 0.1% | |
| 2182.5 | 1 | < 0.1% | |
| 1987.5 | 11 | < 0.1% |
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
First rows
| Unnamed: 0 | purchase_id | purchase_created_at | purchase_updated_at | delivery_time | client_type | order_total_ordered_price | card_creation_date | card_update_date | card_country_code | product_id | product_ordered_quantity | product_total_ordered_price | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 0 | 3415646 | 2020-04-12 00:37:18.699812+00:00 | 2020-04-12 00:37:27.205435+00:00 | 2020-04-22 09:00:00+00:00 | AND | 243.65 | 2020-04-13 01:32:51.731995+00:00 | 2020-04-13 01:32:51.734630+00:00 | 724 | 3132.0 | 8.0 | 5.28 |
| 1 | 1 | 3415646 | 2020-04-12 00:37:18.699812+00:00 | 2020-04-12 00:37:27.205435+00:00 | 2020-04-22 09:00:00+00:00 | AND | 243.65 | 2020-04-23 01:40:58.118080+00:00 | 2020-04-23 01:40:58.120512+00:00 | 724 | 3132.0 | 8.0 | 5.28 |
| 2 | 2 | 3415696 | 2020-04-12 00:39:41.039341+00:00 | 2020-04-12 00:39:43.598551+00:00 | 2020-04-22 09:00:00+00:00 | AND | 243.65 | 2020-04-13 01:32:51.731995+00:00 | 2020-04-13 01:32:51.734630+00:00 | 724 | 3132.0 | 8.0 | 5.28 |
| 3 | 3 | 3415696 | 2020-04-12 00:39:41.039341+00:00 | 2020-04-12 00:39:43.598551+00:00 | 2020-04-22 09:00:00+00:00 | AND | 243.65 | 2020-04-23 01:40:58.118080+00:00 | 2020-04-23 01:40:58.120512+00:00 | 724 | 3132.0 | 8.0 | 5.28 |
| 4 | 4 | 3414916 | 2020-04-12 00:09:13.651819+00:00 | 2020-04-22 05:00:24.316678+00:00 | 2020-04-22 19:00:00+00:00 | WEB | 429.82 | 2020-04-12 00:17:32.422326+00:00 | 2020-04-12 00:17:32.424877+00:00 | 840 | 69912.0 | 12.0 | 2.52 |
| 5 | 5 | 3416314 | 2020-04-12 01:11:01.167822+00:00 | 2020-04-12 01:11:07.516947+00:00 | 2020-04-22 14:00:00+00:00 | AND | 267.74 | 2020-04-12 01:48:13.235014+00:00 | 2020-04-12 01:48:13.237352+00:00 | 724 | 10136.0 | 2.0 | 7.08 |
| 6 | 6 | 3416314 | 2020-04-12 01:11:01.167822+00:00 | 2020-04-12 01:11:07.516947+00:00 | 2020-04-22 14:00:00+00:00 | AND | 267.74 | 2020-04-12 01:48:13.235014+00:00 | 2020-04-12 01:48:13.237352+00:00 | 724 | 3832.0 | 2.0 | 5.38 |
| 7 | 7 | 3416314 | 2020-04-12 01:11:01.167822+00:00 | 2020-04-12 01:11:07.516947+00:00 | 2020-04-22 14:00:00+00:00 | AND | 267.74 | 2020-04-12 01:48:13.235014+00:00 | 2020-04-12 01:48:13.237352+00:00 | 724 | 2814.0 | 2.0 | 6.38 |
| 8 | 8 | 3416314 | 2020-04-12 01:11:01.167822+00:00 | 2020-04-12 01:11:07.516947+00:00 | 2020-04-22 14:00:00+00:00 | AND | 267.74 | 2020-04-12 01:48:13.235014+00:00 | 2020-04-12 01:48:13.237352+00:00 | 724 | 2715.0 | 2.0 | 6.40 |
| 9 | 9 | 3416314 | 2020-04-12 01:11:01.167822+00:00 | 2020-04-12 01:11:07.516947+00:00 | 2020-04-22 14:00:00+00:00 | AND | 267.74 | 2020-04-12 01:48:13.235014+00:00 | 2020-04-12 01:48:13.237352+00:00 | 724 | 3491.0 | 2.0 | 9.40 |
Last rows
| Unnamed: 0 | purchase_id | purchase_created_at | purchase_updated_at | delivery_time | client_type | order_total_ordered_price | card_creation_date | card_update_date | card_country_code | product_id | product_ordered_quantity | product_total_ordered_price | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 661499 | 661499 | 467772 | 2019-08-22 13:54:35.681329+00:00 | 2019-08-23 05:56:26.909067+00:00 | 2019-08-23 05:00:00+00:00 | AND | 498.25 | 2019-09-02 10:40:08.230126+00:00 | 2019-11-06 11:18:44.047489+00:00 | 156 | 45646.0 | 1.0 | 2.99 |
| 661500 | 661500 | 467772 | 2019-08-22 13:54:35.681329+00:00 | 2019-08-23 05:56:26.909067+00:00 | 2019-08-23 05:00:00+00:00 | AND | 498.25 | 2019-09-02 11:35:23.802524+00:00 | 2019-11-06 11:18:44.126883+00:00 | 156 | 45646.0 | 1.0 | 2.99 |
| 661501 | 661501 | 414995 | 2019-07-18 06:19:28.730520+00:00 | 2019-07-19 19:33:17.013962+00:00 | 2019-07-19 17:00:00+00:00 | AND | 269.68 | 2019-04-12 21:26:18.831052+00:00 | 2019-11-06 11:17:10.600675+00:00 | 724 | 4900.0 | 1.0 | 2.99 |
| 661502 | 661502 | 414995 | 2019-07-18 06:19:28.730520+00:00 | 2019-07-19 19:33:17.013962+00:00 | 2019-07-19 17:00:00+00:00 | AND | 269.68 | 2019-04-12 21:26:18.831052+00:00 | 2019-11-06 11:17:10.600675+00:00 | 724 | 4900.0 | 1.0 | 2.99 |
| 661503 | 661503 | 468613 | 2019-08-23 01:25:46.323218+00:00 | 2019-08-24 15:08:00.287868+00:00 | 2019-08-24 14:00:00+00:00 | IOS | 578.51 | 2019-09-14 08:06:40.428802+00:00 | 2019-11-06 11:18:56.185369+00:00 | 372 | 63628.0 | 1.0 | 2.99 |
| 661504 | 661504 | 468613 | 2019-08-23 01:25:46.323218+00:00 | 2019-08-24 15:08:00.287868+00:00 | 2019-08-24 14:00:00+00:00 | IOS | 578.51 | 2019-08-17 02:35:45.444986+00:00 | 2019-11-06 11:18:29.600034+00:00 | 616 | 63628.0 | 1.0 | 2.99 |
| 661505 | 661505 | 468613 | 2019-08-23 01:25:46.323218+00:00 | 2019-08-24 15:08:00.287868+00:00 | 2019-08-24 14:00:00+00:00 | IOS | 578.51 | 2019-08-15 19:44:43.723112+00:00 | 2019-11-06 11:18:28.794384+00:00 | 250 | 63628.0 | 1.0 | 2.99 |
| 661506 | 661506 | 468613 | 2019-08-23 01:25:46.323218+00:00 | 2019-08-24 15:08:00.287868+00:00 | 2019-08-24 14:00:00+00:00 | IOS | 578.51 | 2019-09-14 07:39:29.198021+00:00 | 2019-11-06 11:18:56.178100+00:00 | 643 | 63628.0 | 1.0 | 2.99 |
| 661507 | 661507 | 468613 | 2019-08-23 01:25:46.323218+00:00 | 2019-08-24 15:08:00.287868+00:00 | 2019-08-24 14:00:00+00:00 | IOS | 578.51 | 2019-09-19 23:14:50.113352+00:00 | 2019-11-06 11:19:01.335256+00:00 | 826 | 63628.0 | 1.0 | 2.99 |
| 661508 | 661508 | 468613 | 2019-08-23 01:25:46.323218+00:00 | 2019-08-24 15:08:00.287868+00:00 | 2019-08-24 14:00:00+00:00 | IOS | 578.51 | 2019-08-17 02:38:24.332875+00:00 | 2019-11-06 11:18:29.602371+00:00 | 250 | 63628.0 | 1.0 | 2.99 |